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(57) A scheme for automatic data conversion defi- 
nition generation based on data feature such as a deci- 
sion tree or a statistical feature, so as to enable a quick 
data analysis in a visual multidimensional data analysis 
tool In an apparatus for converting data stored in data- 
base or fHes jnto graphic data according to the data con- 
version definition and displaying the graphic data, a def- 
inition generation assistance device for automatically 
generating the data conversion definition is provided, 



where the definition generation assistance device ex- 
tracts a data feature of the data from a scheme and con- 
tents of the database or files, and automatically gener- 
ates the data conversion definition according to the ex- 
tracted data feature, the data conversion definition be- 
ing formed by an attribute mapping definition defining 
combinations of data attributes and graphic data param- 
eters and a data conversion method definition defining 
a method for converting a value of each data attribute 
into a value of a corresponding graphic data parameter. 
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Description 

BACKGROUND OF THE INVENTION 

FIELD OF THE INVENTION 

The present invention relates to a scheme for auto- 
matically generating graphic expressions according to 
data feature such as a decision tree for classifying target 
attributes or a statistical teature of data in a visual mul- 
tidimensional data analysis tool which is to be used in 
applying a sophisticated data analysis with respect to 
data stored in database or files, and more particularly, 
to a scheme for automatically generating data conver- 
sion definition according to the data feature which is to 
be used in a data conversion display device for convert- 
ing data stored in database or files into graphic data ac- 
cording to the data conversion definition and displaying 
converted graphic data. 

DESCRIPTION OF THE BACKGROUND ART 

In the conventional visual multidimensional data 
analysis tool, visualisation conditions are usually de- 
fined by a user and there is no known scheme for as- 
sisting this definition generation by the user according 
to a decision tree for classifying target attributes or sta- 
tistical feature of data. 

Thus in the conventional visual mullidimensional 
data analysis tool, visualization conditions have to be 
defined by the user so that many trials and errors are 
required at a time of data analysis, and there has been 
a problem that it has been impossible to carry out the 
data analysis quickly. 

Also, in the conventional visual multidimensional 
data analysis tool, visualization target attributes are to 
be seiected and obtained by the user by repeating at- 
tribute selection operations. 

In this regard, in the conventional visual muitidimen- 
sional data analysis tool there is no known scheme for 
automatically selecting target attributes by assigning 
priority orders among a plurality of attributes according 
to a correlation coefficient matrix, and it has been nec- 
essary for the user to seiect the visualization target at- 
tributes so that the user is required to repeat operations 
for selecting attributes that are effective for the data 
analysis and therefore there has been a problem thai it 
has been impossible to carry out the data analysis quick- 
W- 

SUMMARY OF THE INVENTION 

ft is therefore an object of the present invention to 
provide a scheme for automatic data conversion defini- 
tion generation based on data feature such as a decision 
tree or a statistical feature, so as to enable a quick data 
analysis in a visual multidimensional data analysis tool. 

ft is another object of the present invention to pro- 



vide a scheme for automatic data conversion definition 
generation, which is capable of assigning priority orders 
among a plurality of attributes according to a correlation 
coefficient matrix and selecting visualization target at- 

s tributes from data and a number of attributes to be se- 
iected entered by the user at a time of ^argc quantity 
multidimensional data analysis. 

According to one aspect oi the present invention 
there is provided an automatic definition generation de~ 

to vsce for automatically generating a data conversion def- 
inition to be used in an apparatus for converting data 
stored in database or files into graphic data according 
to the data conversion definition and displaying the 
graphic data, the device comprising: a feature extraction 
unit for extracting a data feature of the data from a 
scheme and contents of the database or files, and a def- 
inition generation unit for automatically generating the 
data conversion definition according to the data feature 
extracted by the feature extraction unit the data conver- 
ge S ion definition being formed by an attribute mapping def- 
inition defining combinations of data attributes and 
graphic data parameters and a data conversion method 
definition defining a method lor converting a value of 
each data attribute into a value of a corresponding 

2$ graphic data parameter 

According to another aspect of the present inven- 
tion there is provided an automatic definition generation 
method for automatically generating a data conversion 
definition to be used in an apparatus for converting data 

30 stored in database or files into graphic data according 
to the data conversion definition and displaying the 
graphic data, the meihod comprising the steps of: ex- 
tracting a data feature of the data from a scheme and 
contents of the database or files: and automatically gem 

35 erating the data conversion definition according to the 
data feature extracted by the extracting step, the data 
conversion definition bemg formed by an attribute map- 
ping definition defining combinations of data attributes 
and graphic data parameters and a data conversion 

40 method definition defining a method for converting a val- 
ue of each data attribute into a value of a corresponding 
graphic data parameter 

According to another aspect of the present inven- 
tion there is provided an article of manufacture, com- 

45 prising: a computer usable medium having computer 
readable program code means embodied therein lor 
causing a computer to function as an automatic defini- 
tion generation system for automatically generating a 
data conversion definition to be used in an apparatus 

53 lor converting data stored in database or flies into graph- 
ic data according to the data conversion definition and 
displaying the graphic data, the computer readable pro- 
gram code means includes: first computer readable pro- 
gram code means for causing said computer to extract 

55 a data feature of the data from a scheme and contents 
of the database or files; and second computer readable 
program code means for causing said computer to au- 
tomatically generate the data conversion definition ac- 
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cording to the data feature extracted by the first compu- 
ter readable program code means the data conversion 
definition being formed by an attribute mapping defini- 
tion defining combinations of data attributes and graphic 
data parameters and a data conversion method defini- 
tion defining a method for converting a value of each 
data attribute into a vaiue of a corresponding graphic 
data parameter. Other features and advantages of the 
present invention will become apparent from the follow- 
ing description taken in conjunction with the accompa- 
nying drawings 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram of a data conversion dis- 
play apparatus using a decision tree based definition 
generation assistance device according to the fust em- 
bodiment of the present invention. 

Fig. 2 is a block diagram showing an interna! con- 
figuration of a decision tree constraint condition gener- 
ation unit in a decision tree based definition generation 
assistance device in the apparatus of Fig 1 . 

Fig. 3 is a table showing a data priority order as- 
srgnment rule used in a decision tree based definition 
generation assistance device in the apparatus of Fig. 1 . 

Fig. 4 is a table showing an expression condition 
used in a decision tree based definition generation as- 
sistance device in the apparatus of Fig. 1 . 

Fig 5 is a block diagram showing an internal con- 
figuration of a definition combination generation unit in 
a decision tree based definition generation assistance 
device in the apparatus of Fig. 1 . 

Fig. 6 is a flow chart for a data conversion definition 
generation processing by a decision tree based defini- 
tion generation assistance device in the apparatus of 35 
Fig. t 

Fig. 7 is a diagram showing an exemplary process- 
ing by a decision tree constraint condition generation 
unit in a decision tree based definition generation as- 
sistance device in the apparatus of Fig. 1 . 40 

Fig. S is a diagram showing an exemplary process- 
mg by a definition combination generatron unit in a de- 
cision tree based definition generation assistance de- 
vice in the apparatus of Fig 1 

Fig. 9 is a diagram showing an outline of a data con- 45 
version processing by a decrs^on tree based definition 
generation assistance device in the apparatus of Fig. 1 . 

Fig. 10 is a block diagram of a data conversion dis- 
play apparatus using a statistical feature based defini- 
tion generation assistance device according to the sec- so 
ond embodiment of the present invention. 

Fig 11 is a block diagram showing an internal con- 
figuration of a statistical constraint condition generation 
unit in a statistical feature based definition generation 
assistance device in the apparatus of fig. 10, 55 

Fig. 12 is a table showing a data priority order as- 
signment rule used in a statistical feature based defini- 
tion generation assistance device in the apparatus oi 



Fig. 10, 

Fig. 13 is a table showing an expression condition 
used in a statistical feature based definition generation 
assistance device in the apparatus of F^g. 10. 
5 Fig. 1 4 is a block diagram showing an internal con- 

figuration of a definition combination generation unit in 
a statistical feature based definition generation assist- 
ance device in the apparatus of Fig. 10. 

Fig 1 5 is a flow chart for a data conversion definition 
to generation processing by a statistical feature based def- 
inition generation assistance device in the apparatus of 
Fig. 10. 

Fig 1 6 is a diagram showing an exemplary process- 
ing by a statistical constraint condition generation unit 
15 in a statistical feature based definition generation assist- 
ance device in the apparatus of Fig. 10. 

Fig, 1 7 is a diagram showing an exemplary process- 
ing by a definition combination generation unit in a sta- 
tistical feature based definition generation assistance 
20 device in the apparatus of Fig. 10 

Fig 18 is a diagram showing an outline ot a data 
conversion processing by a statistical feature based def- 
inition generation assistance device in the apparatus of 
Fig. 10. 

Fig. 19 is a diagram showing a correlation coeffi- 
cient matrix used in the third embodiment of the present 
invention. 

Fig. 20 is a flow chart for an attribute priority order 
assignment method according to the third embodiment 
of the present invention for a case using a maximum 
correlation. 

Fig. 21 is a flow chart for an attribute priority order 
assignment method according to the third embodiment 
of the present invention for a case using ali correlation 
coefficients comprehensively. 

Fig. 22 is a flow chart for an attribute priority order 
assignment method according to the third embodiment 
of the present invention for a case using limited corre- 
lation coefficients 

Fig 23 is a diagram showing an exemplary data in 
a case of obtaining a discrete index used in the third 
embodiment of the present invention. 

Fig 24 is a ffow chart for an attribute priority order 
assignment method according to the third embodiment 
of the present invention for a case using a discrete in- 
dex. 

Fig. 25 es a flow chart for an attribute priority order 
assignment method according to the third embodiment 
of the present invention for a case merging a maximum 
correlation based case and ail correlation coefficrents 
based case. 

DETAILED DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

Referring now to Fig. 1 to Fig. 9, the first embodi- 
ment of an automatic data conversion definition gener- 
ation scheme according to the present invention will be 
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described in detail 

Fig 1 shows a configuration of a data conversion 
display apparatus using a decision tree based definition 
generation assistance device according to this first em- 
bodiment, in the data conversion display apparatus of 
Fig. 1 , data stored in a data source such as database 
or flies 1 00 are converted into graphic data according to 
a data conversion definition 101 at a data conversion 
device 102, and the converted graphic data are dis- 
played at a display device 1 03. The data conversion def- 
inition 101 is conventionally to be defined by the user 
through a definition input GUI (Grapicai User Interface) 
device 104, but in this first embodiment, the data con- 
version definition 101 is to be generated automatically 
according to a data feature at a decision tree based def- 
inition generation assistance device 107 In addition, it 
is possible for the user to change the automatically gen- 
erated data conversion definition 101 through the defi- 
nition input GUI device 104 in this data conversion dis- 
play apparatus of Fig, 1 Note that the data conversion 
definition 101 is an information that defines how data 
stored in the database or files 100 are to be converted 
into graphic data by the data conversion device 102 

As shown in Fig 1, the decision tree based defini- 
tion generation assistance device 107 comprises a de- 
cision tree constraint condition generation unit 105 and 
a definition combination generation unit 106. 

As shown in Fig. 2, the decision tree constraint con- 
dition generation unit 105 further comprises a decision 
tree generation unit 201 that generates a decision tree 
for classifying target attributes from scheme and con- 
tents of the database or files 100 : and a combination 
condition generation engine 203 that generates candi- 
dates lor a combination condition on attribute mapping 
from a decision tree feature data 202 of the decision tree 
generated by the decision tree generation unit 201 Note 
that the decision tree is a well known tree-like classifier 
generated by sequentially selecting attributes that are 
most effective in classifying records, according to values 
of target attributes which are classification targets en- 
tered by a user. 

Here, the generation of the combination condition 
in the combination condition generation engine 203 is 
carried out according to a data priority order assignment 
rule shown in Fig. 3, that is. a data priority order assign- 
ment rule for assigning prionty orders to data conversion 
target attributes or values according to the decision tree 
feature data 202 

In this first embodiment, the decision tree feature 
data is given by vertices of the decision tree, and the 
data prionty order assignment rule assigns higher prior- 
ity orders from upper level vertices, or from lower levei 
vertices. Note that the combination condition on at- 
tribute mapping is an information that indicates an or- 
dered sequence of attributes obtained according to this 
data prionty order assignment rule. 

As shown in Fig. 5, the definition combination gen- 
eration unit 106 further comprises an attribute mapping 



definition generation unit 302 for determining attributes 
to be used in the data conversion by combining the com- 
bination condition generated by the decision tree con- 
straint condition generation unit 105 and a pre-regis- 
5 tered expression condition 301 , and a data conversion 
method parameter automatic generation unit 303 for de- 
termining parameters corresponding to data conversion 
methods. 

The expression condition 301 is given as shown in 
io Fig 4, and an attribute mapping definition 101a and a 
data conversion method definition 101b that constitute 
the data conversion definition 101 are generated at the 
attribute mapping definition generation unit 302 and the 
data conversion method parameter automatic genera- 
ls tion unit 303, respectively, by combining the combina- 
tion condition generated according to the data priority 
order assignment rule in view of the decision tree feature 
data including the decision tree vertices and discrimina- 
tion values used in classifying records at the decision 
20 tree vertices as well as the expression condition deter- 
mined according to a data type of each data conversion 
target data. Here, the attribute mapping definition 101a 
is an information that defines which attribute is to be set 
in correspondence (which attribute is to be mapped) to 
25 which graphic data parameter, and the data conversion 
method definition 101b is an information that defines a 
method by which a value of each attribute is to be con- 
verted into a value of the mapped graphic data param- 
eter. 

30 Now, the data conversion definition generation 

processing by the decision tree based definition gener- 
ation assistance device 107 of this first embodiment is 
carried out according to the flow chart of Fig. 6 as fol- 
lows. 

35 First, analysis target data and target attribute spec- 

ification is entered (step S401), and data specified by 
this input is processed by a decision tree generation al- 
gorithm (step S402). 

Then, whether the generation of a decision tree was 
40 successful or not is checked (step S41 1 ). When the gen- 
eration of a decision tree was unsuccessful , an error out- 
put is made and the processing is terminated (step 
3412), whereas when the generation of a decision tree 
was successful, vertices of the decision tree are extract- 
's ed and rearranged in an order giving higher prionty or- 
ders for upper levei vertices or in an order giving higher 
priority orders for lower level vertices (step S403). Then, 
in a case where the obtained ordered sequence of at- 
tributes has the same attribute appearing more than 
50 once, lower levei ones among overlapping attributes are 
eliminated (step S404). 

Next, the data type of each data conversion target 
attribute is judged to be quantitative or quaHtative, se- 
quentially in an order of priority orders among attributes 
56 (step S413) Here, the quantitative data is data which 
has a value in a continuous range, and the qualitative 
data is data which has a value in a discrete range. 

When the data conversion target attribute is the 
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quantitative data, the attribute mapping definition for this 
data conversion target attribute is generated for a stiil 
unmapped graphic data parameter which is selected in 
an order of X-axis, Y-axis, size, color label and shape 
according to the expression condition for the quantita- s 
tive data as shown in Fig. 4 (step $405), and the data 
conversion method definition is generated by means of 
logical arrangements for X-axis and Y-axis and a thresh- 
old specification or a uniform division specification for 
the others (step S406). 10 

On the other hand, when the data conversion target 
attribute is the qualitative data, the attribute mapping 
definition for this data conversion target attribute is gen- 
erated for a stil) unmapped graphic data parameter 
which is selected in an order of X-axis, Y-axis, color ?5 
size, shape and label according to the expression con- 
dition for the qualitative data as shown in Fig. 4 (step 
S407), and the data conversion method definition is 
generated by means of logical arrangements for X-axis 
and Y-axis and a character string specification or a 20 
threshold specification for the others (step S408). 

Finally, the attribute mapping definition and the data 
conversion method definition generated at the steps 
S405 and S406 or the steps S407 and S408 are com- 
bined so as to generate the data converston definition 25 
101 (step S409). 

Note that the logical arrangement (relative projec- 
tion arrangement) is a method for arranging figures ac- 
cording todata attribute values in which figures with min- 
imum and maximum values for data attributes mapped 30 
to X-axis and Y-axis are arranged at edges of a screen 
and each figure is arranged at X-axis and Y-axis values 
that are proportional to corresponding data attribute val- 
ues. 

Also, the threshold specification is a method for 35 
converting data attribute values into graphic data pa- 
rameter values in which data attribute values are divided 
at arbitrary data attribute value used as a threshold and 
divided data attribute values are converted into different 
graphic data parameter values. This threshold specifi- 40 
cation can be defined for both quantitative data and 
qualitative data but may not be defined for some quali- 
tative data such as those given by character strings. In 
this embodiment, the discrimination value at a corre- 
sponding decision tree vertex is to be used as threshold 45 
in a case of the threshold specification. 

Also, the uniform division specification is a method 
for converting data attribute vaiues into graphic data pa- 
rameter values in which a range of data attribute values 
from a maximum vaiue to a minimum value is divided so 
into a prescribed division number of equal sections and 
data attribute values of different sections are converted 
into different graphic data parameter values This uni- 
form division specification is defined for the quantitative 
data . 55 

A!so. : the character string specification is a method 
for converting data attribute values into graphic data pa- 
rameter values in which each data attribute value is con- 



verted into a one-to-one corresponding graphic data pa- 
rameter value. This character string specification is de- 
fined lor the qualitative data 

Fig 7 shows an exemplary processing by the deci- 
sion tree constraint condition generation unit 1 05. In this 
example, the analysis target data are subjected to the 
decision tree generation processing by the decision tree 
generation unit 201 to yield the decision tree feature da- 
ta 202, and the combination conditions are generated 
by the combination condition generation engine 203 us- 
ing the data priority order assignment rule shown in Fig. 
3, according to thas decision tree feature data 202. m 
this case, method parameters to be used at the data 
conversion method parameter automatic generation 
unit 303 in the definition combination generation f 06 at 
the next stage are set to be data value, maximum value, 
minimum vaiue and discrimination value. 

Fig. 8 shows an exemplary processing by the defi- 
nition combination generation unit 106 In this example, 
the the attribute mapping definition and the data conver- 
sion method definition are generated by combining each 
combination condition and the expression condition 301 
of Fig. 4 determined by the data type of the data con- 
version target data which is selected according to each 
combination condition 

Fig. 9 shows an outline of the data conversion 
processing for converting data of the data source such 
as database or files 100 into graphic data according to 
the data conversion definifon 101 generated by the de- 
cision tree based definition generation assistance de- 
vice 107 of this first embodiment. 

As shown in Fig. 9, in the data conversion definition 
101 ? the attribute mapping definition 101a specifies at- 
tributes corresponding to label, size, color shape, X-ax- 
is and Y-axis of the graphic object (node type object), 
while the data conversion method definition 101b spec- 
ifies a conversion method for each attribute specified by 
the attribute mapping definition 101a. 

Referring now to Fig. 10 to Fig. 18 : the second em- 
bodiment of an automatic data conversion definition 
generation scheme according to the present invention 
will be described in detail. 

Fig. 10 shows a configuration of a data conversion 
display apparatus using a statistical feature based def- 
inition generation assistance device according to this 
second embodiment. In the data conversion display ap- 
paratus of Fig, 10 ; data stored in a data source such as 
database or files 1100 are converted into graphic data 
according to a data conversion definition 1-101 at a data 
conversion device 1 1 02, and the converted graphic data 
are displayed at a display device 1103. The data con- 
version definition 1101 is conventionally to be defined 
by the user through a definition input GUI (Grapicai User 
Interface) device 1104, but in this second embodiment 
the data conversion definition 1101 is to be generated 
automatically according to a statistical feature of data at 
a statistical feature based definition generation assist- 
ance device 1107. in addition, it is possible for the user 
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to change the automatically generated data conversion 
definition 1101 through the definition *nput GUI device 
1104 in this data conversion display apparatus of Fig. 
10 Here, the meaning of the data conversion definition 
is the same as in the first embodiment described above. 

As shown in Fig 10, the statistical feature based 
definition generation assistance device 11 07 comprises 
a statistical constraint condition generation unit 1105 
and a definition combination generation unit 1106. 

As shown in Fig. 11 , the statistical constraint condi- 
tion generation unit 1105 further comprises a statistical 
feature extraction unit 1201 that extracts a statistical 
feature of data from scheme and contents of the data- 
base or fiies 1100. and a combination condition gener- 
ation engine 1 203 thai generates candidates for a com- 
bination condition on attribute mapping from a statistical 
feature data 1202 generated by the statistical feature 
extraction unit 1201 . 

Here, the generation of the combination condition 
in the combination condition generation engine 1203 is 
carried out according to a data priority order assignment 
rule shown in Fig. 12, that is, a data priority order as- 
signment rule for assigning priority orders to data con- 
version target attributes or values according to the sta- 
tistical feature data 1202 

In this second embodiment, the statistical feature 
data is given by any of a factor a factor loading, a cor- 
relation coefficient (matrix), a cluster and a correlation 
coefficient (matrix), or their corresponding quantification 
theory results. Also, the data priority order assignment 
rule assigns a higher priority order from the first factor 
when the statistical feature is a factor or its correspond- 
ing quantification theory result, assigns higher priority 
orders from attributes contained in the first factor when 
the statistical feature is a factor loading or its corre- 
sponding quantification theory result, assigns higher pri- 
ority orders from a set of attributes with a higher corre- 
lation coefficient when the statistical feature is a corre- 
lation coefficient or its corresponding quantification the- 
ory result, or assigns higher priority orders from clusters 
and then a set of attributes with a higher correlation co- 
efficient when the statistical feature is a cluster and cor- 
relation coefficient or their corresponding quantification 
theory results, Note that the combination condition on 
attribute mapping is an information that indicates an or- 
dered sequence of factors, attributes and clusters ob- 
tained according to this data priority order assignment 
rule Note also that factor, factor loading^ correlation co- 
efficient, cluster and quantification theory are the well 
known concepts in the muitivanate analysis 

As shown in Fig. 1 4 ; the definition combination gen- 
eration unit 1 1 06 further comprises an attribute mapping 
definition generation unit 1 302 for determining attributes 
to be used in the data conversion by combining the com- 
bination condition generated by the statistical constraint 
condition generation unit 1105 and a pre-registered ex- 
pression condition 1 301 , and a data conversion method 
parameter automatic generation unit 1303 for determin- 



ing parameters corresponding to data conversion meth- 
ods. 

The expression condition 1301 is given as shown 
in Fig. 1 3, and an attribute mapping definition 1101a and 
£ a data conversfon method definition 1101b that consti- 
tute the data conversion definition 1101 are generated 
at the attribute mapping definition generation unit 13G2 
and the data conversion method parameter automatic 
generation unit 1303, respectively, by combining the 
10 combination condition generated according to the data 
priority order assignment rule in view of the statistical 
feature data and the expression condition determined 
according to a data type of each data conversion target 
data. Here, the meanings of the attribute mapping defi- 
es nition 1 1 0ta and the data conversion method definition 
1101b are the same as in the first embodiment de- 
scribed above. 

Now, the data conversion definition generation 
processing by the statistical feature based definition 

20 generation assistance device 1107 of this second em- 
bodiment is carried out according to the flow chart of 
Fig. 15 as follows 

First, analysis target data specification is entered 
(step 31401), and data specified by this input is proc- 

2S essed by a statistical processing (step S1402). 

Then, the statistical feature is extracted (step 
S1403), and data are rearranged according to the data 
priority order assignment rule (step St 404) 

Next the data type of each data conversion target 

so attribute is judged to be quantitative or qualitative, se- 
quentially in an order of priority orders among attributes 
(step SI 411 ). Here, the meanings of the quantitative da- 
ta and the qualitative data are the same as in the first 
embodiment described above. 

35 When the data conversion target attribute is the 

q uantitat ive data , th e att ribute mapping def init ion for this 
data conversion target attribute is generated for a still 
unmapped graphic data parameter which is selected in 
an order of X-axis, Y-axis, size, color label and shape 

40 according to the expression condition for the quantita- 
tive data as shown in Fig 1 3 (step SI 405). and the data 
conversion method definition is generated by means of 
logical arrangements for X-axis and Y-axis and a uni- 
form division specification or a threshold specification 

4S for the others (step S1 406), 

On the other hand, when the data conversion target 
attribute is the qualitative data, the attribute mapping 
definition for this data conversion target attribute is gen- 
erated for a still unmapped graphic data parameter 

so which is selected in an order of X-axis ; Y-axis, color, 
size, shape and label according to the expression con- 
dition for the qualitative data as shown in Fig. 13 (step 
S1407), and the data conversion method definition is 
generated by means of logical arrangements for X-axis 

s$ and Y-axis and a character string specification or a 
threshold specification for the others (step SI 408). 

Then, whether it is a threshold specification or not 
is judged (step SI 409), and when it is a threshold spec- 
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ificalion, a threshold is entered by a user (step S1410) 
Finally, the attribute mapping definition and the data 
conversion method definition generated at the steps 
SI 405 and S1406 or the steps S1407 and S140S are 
combined so as to generate the data converse defini- £ 
lion 1101 {step S1411). 

Here, the meanings of the logical arrangement, the 
threshold specification, the uniform division specifica- 
tion, and the character string specification are the same 
as in the first embodiment described above 10 

Fig. 18 shows an exemplary processing by the sta- 
tistical constraint condition generation unit 1105. in this 
example, the analysis target data are subjected to the 
statistical processing by the statistical feature extraction 
unit 1201 to yield the statistical feature data 1202 : and 
the combination conditions are generated by the com- 
bination condition generation engine 1 203 using the da- 
ta priority order assignment rule shown in Fig 12, ac- 
cording to this statistical feature data 1 202. In this case, 
method parameters to be used at the data conversion 20 
method parameter automatic generation unit 303 in the 
definition combination generation 106 at the next stage 
are set to be data value, maximum value, and minimum 
value. Note that, in Fig. 16 t! scofe21 " for example indi- 
cates a factor score for the second factor and the first 25 
sample rl, and "coef!36" for example indicates a corre- 
lation coefficient between the attributes a3 and a6 

Fig 1 7 shows an exemplary processing by the def- 
inition combination generation umi 1106 fn this exam- 
pie. the the attribute mapping definition and the data 30 
conversion method definition are generated by combin- 
ing each combination condition and the expression con- 
dition 1301 of Fig. 13 determined by the data type of the 
data conversion target data which is selected according 
to each combination condition. 35 

Fig. 18 shows an outline of the data conversion 
processing for converting data of the data source such 
as database or files 1100 into graphic data according to 
the data conversion definition 1101 generated by the 
statistical feature based definition generation assist- 40 
ance device 1107 of this second embodiment, 

As shown in Fig. 18, in the data conversion defini- 
tion 1101, the attribute mapping definition 1101a speci- 
fies attributes corresponding to labeL size. cofor : shape. 
X-axis and Y-axis of the graphic object (node type ob- 45 
\ec\h whiie the data conversion method definition 1101b 
specifies a conversion method for each attribute speci- 
fied by the attribute mapping definition 1101a. 

Referring now to Fig, 1 9 to Fig. 25 : the third embod- 
iment of an automatic data conversion definition gener- so 
ation scheme according to the present invention will be 
described in detail This third embodiment is directed to 
a case of using the correlation coefficient (matrix) as the 
statisticai feature in the second embodiment described 
above, and shows other examples of priority order as- &s 
signment method for data conversion target attributes 
based on the correlation coefficient. 

Fig. 19 shows a correlation coefficient matrix used 



in Xh\s third embodiment, in Fig T9, A 0 , , A n de- 
note attributes, and O m , , C n n denote correlation 

coefficients among attributes, where Cy is a correlation 
coefficient between attributes A } and Aj. 

in the following description of this third embodiment, 
the attribute priority order assignment method will be de- 
scribed for various cases including a case using a max- 
imum correlation, a case using all correlation coeffi- 
cients comprehensively, a case using limited correlation 
coefficients, a case us*ng a discrete index, and a case 
merging a maximum correlation based case and all cor- 
relation coefficients based case, 

Now. the first case of assigning priority orders to a 
plurality of attributes according to a largest correlation 
coefficient and correlation coefficients of a set of at- 
tributes having the largest correlation coefficient with re- 
spect to the other attributes in the correlation coefficient 
matrix among attributes will be described. 

Fig 20 shows a flow chart tor the attribute priority 
order assignment method using a maximum correlation 
in this third embodiment, which proceeds as follows. 

Step S2101: First, data are entered. 

Step 521 02: The correlation coefficient matrix is cal- 
culated. 

Step 32103: The highest priority order is assigned 
to a set of attributes A k and which has the largest 
correlation coefficient C k1 . 

Step S2104: The correlation coefficients C k1 , C k2 , 

C k1 , , C kn of the attribute A k with respect to 

the other attributes are multiplied with corresponding 

ones of the correlation coefficients C 1V C 12 , C 1h , 

C 1ft of the attribute A 1 with respect lo the other 

attributes, to obtain M 1 , M 2 , , M n flvlj - C ki x C ri ). 

Step S2105: M k and bA A are removed from M n t M 2 , 
= M n3 and then each one of remaining M is sequen- 
tially extracted in a decreasing order of their values. 

Step S2106: The subsequent priority order is se- 
quentially assigned to an attribute used in calculating 
each M ; in an order by which each M is extracted (When 
M t is extracted, the next priority order is assigned to an 
attribute Aj). 

Next the second case of assigning priority orders 
to a plurality of attributes by applying a numerical calcu- 
lation processing to the correlation coefficients of each 
attribute with respect to the other attributes and assign- 
ing a higher priority order to an attribute which has high 
correlations with all the attributes comprehensively will 
be described. 

Fig. 21 shows a flow chart for the attribute priority 
order assignment method using all correlation coeffi- 
cients comprehensively in this third embodiment which 
proceeds as follows. 

Step S2201; First, data are entered, 

Step S2202: The correlation coefficient matrix is 
calculated. 

Step S2203: For each one of the attributes A,, A 2 
: A n . a sum S v S 2 , , S n of correlation coeffi- 
cients with respect to the other attributes is obtained. 
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Here, the sum Sj of the correlation coefficients of an at- 
tribute Aj wiih respect to the other attributes is given as 
follows. 

s i " °ii + C i2 + * C in 

Step S2204: The highest priority order is assigned 
to an attribute A k which has the largest value S k among 
the obtained sums S 1s S 2: , S n 

Step S2205: Each one of the correlation coefficients 

C k1 , C k2 , , C kn of the attribute A k with respect to 

the other attributes is extracted sequentially in a de- 
creasing order of their values. 

Step S2206: The subsequent priority order is se- 
quentially assigned to an attribute used in calculating 
each correlation coefficient, in an order by which each 
correlation coefficient is extracted. 

Next, the third case in which a limitation based on 
a number of attributes to which the priority orders are to 
be assigned is added to the above described second 
case of using all correlation coefficients comprehensive- 
ly will be described. 

Fig 22 shows a flow chart for the attribute priority 
order assignment method using limited correlation co- 
efficients in this third embodiment, which proceeds as 
follows. 

Step S2301: First, data and a number m of at- 
tributes to which the priority orders are to be assigned 
are entered. 

Step S2302; The correlation coefficient matrix is 
calculated. 

Step S23G3: For each one of the attributes A 1 , A 2 , 
s A n! a sum S 1; S 2 > , S n of (m-1) larger cor- 
relation coefficients among correlation coefficients with 
respect to the other attributes is obtained 

Step S2304: The highest priority order is assigned 
to an attribute A k which has the iargest value S k among 
the obtatned sums S v S 2 , , S n . 

Step S2305: Each one of the correlation coefficients 

C k1 , C k2 ■ C kn of the attribute A k with respect to 

the other attributes is extracted sequentially in a de- 
creasing order of their values. 

Step S2306: The subsequent priority order is se- 
quentially assigned to an attribute used in calculating 
eace correlation coefficient, in an order by which each 
correlation coefficient is extracted 

Next, the fourth case of assigning priority orders to 
attributes by using a discrete index which is calculated 
from a number of attribute value types and a number of 
re co is and indicating a continuity of attribute value, in 
the above described second case of using all correction 
coefficient comprehensively will be described. 

Fig. 23 shows an exemplary data in a case of ob- 
taining a discrete index t to be utilized in this fourth case. 

For each attribute, the discrete index - is defined as 
a number of attribute value types divided by a number 
of records For an attribute A 1 in the exemplary data 



shown in Fig. 23. for example, the attribute value has 
three types of XXX. YYY and ZZZ. and a number of 
records is eight, so that the discrete index x 1 is given by 

3/8 : : 0.375 

s Fig 24 shows a flow chart for the attribute priority 

orderassignment method usingthe discrete index in this 
third embodiment, which proceeds as follows. 
Step S24D1 : First, data are entered. 
Step S2402: The correlation coefficient matrix is 
io calculated. 

Step 32403 For each one of the attributes A 1; A 2 , 
* , A n . a sum S v S 2 , , S n of correlation coef- 
ficients with respect to the other attributes is obtained. 
Here, the sum S l of the correlation coefficients of an a\- 
*s tribute Aj with respect to the other attributes is given as 
follows. 



Step 32404: For each one of the attributes A-, , A 2 s , 
A n; the discrete index x v x z , - , t n is obtained. 

Step S2405; For each one of the attributes A 1s A 2 , 

, A n , a product Q t , Q 2 , , Q rt of the sum S 2 , 

25 s n of the correlation coefficients and the discrete 

index t 1: x 2 , , t n is obtained. 

Step S2406: The highest priority order is assigned 
to an attribute A k which has the largest value Q k among 

the obtained products Q r Q 2 , >Q n . 

30 step S2407: Each one of the correlation coefficients 

C k1 , C k2; , C kn of the attribute A k with respect to 

the other attributes is extracted sequentially in a de- 
creasing order of their values. 

Step S2408: The subsequent priority order is se- 
35 quentially assigned to an attribute used in calculating 
each correlation coefficient, in an order by which each 
correlation coefficient is extracted. 

Next, the fifth case of assigning priority orders to 
attributes by merging the priority orders determined by 
40 the above described first case of using the maximum 
correlation and the above described second case of us- 
ing all correlation coefficients comprehensively will be 
described. 

Fig. 25 shows a flow chart for the attribute priority 
45 order assignment method merging the maximum corre- 
lation based case and the all correlation coefficients 
based case in this third embodiment, which proceeds 
as foliows. 

Step $2501; First, data are entered. 
so Step S2502: The correlation coefficient matrm is 
calculated 

Step S2503: The priority orders P1 for a plurality of 
attributes are obtained by using the attribute priority or- 
der assignment method using the maximum correlation, 
S5 as in the first case described above 

Step S2504: The priority orders P2 for a plurality of 
attributes are obtained by using the attribute priority or- 
der assignment method using aU correlation coefficients 
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comprehensively; as in the second case described 
above. 

Step S2505: For each one of the attributes, a sum 
of Pi and P2 is calculated, and the calculated sum is 
extracted sequentially in an increasing order of their vai- £ 
ues 

Step S2506: The priority order is sequentially as- 
signed to an attribute corresponding to each sum. in an 
order by which each sum is extracted. 

Thus, according to this third embodiment, it be- 70 
comes possible to select attributes which are to be used 
as data conversion target attributes, by utilizing the cor- 
relation coefficient matrix among data attributes as the 
statistical feature. 

As described, according to the present invention, 15 
is possible for the user to try visual expressions based 
on a feature of the decision tree which is effective in clas- 
sifying the target attributes, without being required to en- 
ter definition, at a time of large quantity multidimensional 
data analysis. In addition, when a feature or a tendency 20 
of the analysis target data is unknown, it is still possible 
to carry out the analysis according to visual expressions 
based on the feature of the decision tree for classifying 
the target attributes 

Also, according to the present invention, it is possh 25 
ble for the user to try visual expressions based on a sta- 
tistical feature of data, without be?ng required to enter 
definition, at a time of large quantity multidimensional 
data analysis. In addition, when a feature or a tendency 
of the analysis target data is unknown, it is still possible so 
to carry out the analysis according to visual expressions 
based on the statistical feature Moreover, it is also pos- 
sible to reflect the user's intention by allowing the user 
to select the definition from the visual expressions 
based on the statistical feature. 35 

Also, according to the present invention, it is possi- 
ble to automatically select attributes which are to be the 
visualization targets in the visual multidimensional data 
analysis tool, by utilizing the correlation coefficient ma- 
trix among data attributes, and therefore it becomes 40 
possible to assign the priority orders among the at- 
tributes according to the correlation coefficient matrix 
and selecting visualization target attributes from data 
and a number of attributes to be selected entered by the 
user at a t^me of large quantity multidimensional data 4B 
analysis. 

As a result, even when the analysis policy is unde- 
cided by the user, it is possible to select a plurality of 
correlated attributes and use them for trials. 

it is to be noted that the above described embodi- so 
ments according to the present invention may be con- 
veniently implemented using conventional general pur- 
pose digital computers programmed according to the 
teachings of the present specification, as will be appar- 
ent to those skilled in the computer art Appropriate soft- ss 
ware coding can readily be prepared by skined program- 
mers based on the teachings of the present disclosure, 
as will be apparent to those skilled in the software art. 



In particular any of the processing in the first to third 
embodiments described above can be conveniently im- 
plemented in forms of software package The program 
for the processing ot the definition generation assist- 
ance device of the first or second embodiment may be 
stored in a storage medium such as a disk device that 
can be activated according to the need, or stored in a 
portable storage medium such as floppy disk or CD- 
ROM that is to be installed into a personal computer of 
the user at a time of use 

Such a software package can be a computer pro- 
gram product which employs a storage medium includ- 
ing stored computer code which is used to program a 
computer to perform the disclosed function and process 
of the present invention. The storage medium may in- 
clude, but is not limited to, any type of conventional flop- 
py disks : optical disks, CD-ROMs, magneto-optical 
disks, ROMs, RAMs, EPROMs, EEPROMs, magnetic 
or optical cards, or any other suitable media for storing 
electronic instructions. 

It is also to be noted that the expression conditions 
of Fig. 4 and Fig. 1 3 used in describing the first and sec- 
ond embodiments are just examples, and types of pa- 
rameters and their orders given in the mapping column 
and the data conversion parameter columns may be 
suitable changed according to the need. 

It is also to be noted thai besides those already 
mentioned above, many modifications and variations of 
the above embodiments may be made without departing 
from the novel and advantageous features of the 
present invention. Accordingly, all such modifications 
and variations are intended to be included within the 
scope of the appended claims. 



Claims 

1. An automatic definition generatron device for auto- 
matically generating a data conversion definition to 
be used in an apparatus for converting data stored 
in database or files into graphic data according to 
the data conversion definition and displaying the 
graphic data, the device comprising; 

a feature extraction unit for extracting a data 
feature of the data from a scheme and contents 
of the database or files; and 
a definition generation unit for automatically 
generating the data conversion definition ac- 
cording to the data feature extracted by the fea- 
ture extraction unit, the data conversion defini- 
tion being formed by an attribute mapping def- 
inition defining combinations of data attributes 
and graphic data parameters and a data con- 
version method definition defining a method for 
converting a value of each data attribute into a 
value of a corresponding graphic data param- 
eter. 
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2. The device of claim 1 . wherein the definition gener- 
ation unit automaticaily generates the attribute 
mapping definition that gives an effective visual ex- 
pression, using a prescribed expression condition 
indicating the graphic data parameters to which the £ 
data attributes should be set in correspondence at 
higher priority orders, according to whether a data 
type of each data attribute to be a target ot conver- 
sion into graphic data is quantitative or qualitative. 

10 

3. The device of ciaim 2, herein the graphic data pa- 
rameters are parameters specifying X-axis. Y-axis, 
size, color iabel and shape of a graphic figure, and 
the expression condition specifies priority orders 
among said parameters by which the data attributes f$ 
are to be set in correspondence to said parameters, 
said priority orders being different orders depending 

on whether the data type of each data attribute is 
quantitative or qualitative. 

20 

4. The device of claim 1 , wherein the definition gener- 
ation unit automatically generates the data conver- 
sion method definition that gives an effective visual 
expression, using a prescribed expression condi- 
tion indicating conversion methods tor converting a 25 
value of each data attribute into a value of a corre- 
sponding graphic data parameter according to 
whether a data type of each data attribute to be a 
target of conversion into graphic data is quantitative 

or qualitative. 30 

5. The device of claim 4, wherein the conversion meth- 
ods includes a conversion by logical arrangement, 
a conversion by threshold specification, a conver- 
sion by uniform division specification, and a conver- 55 
sion by character string specification, and the ex- 
pression condition specifies the conversion meth- 
ods to be different methods depending on whether 
the diata type of each data attribute is quantitative 

or qualitative 40 

6. The device of claim 1, wherein the feature extrac- 
tion unit also assigns priority orders among data at- 
tributes to be targets of conversion into graphic data 
parameters using a prescribed data priority order 
assignment rule according to the data feature, and 
automatically derives attribute mapping combina- 
tion conditions indicating ordered sequences of the 
data attributes to be targets of conversion into 
graphic data parameters that give effective visual 50 
expressions. 

7. The device of claim 6, wherein the feature extrac- 
tion unit obtains a decision free for classifying target 
attributes with respect to the data, and extracts a 55 
feature of obtained decision tree as the data fea- 
ture 



8. The device of ciaim 7, wherein the feature ot the 
decision tree is a feature regarding vertices of the 
decision tree, and the data priority order assign- 
ment rule sequentially assigns the priority orders 
from those data attributes which are corresponding 
to upper level vertices. 

9. The device of claim 7, wherein the feature of the 
decision tree is a feature regarding vertices of the 
decision tree, and the data priority order assign- 
ment rule sequentially assigns the priority orders 
from those data attributes which are corresponding 
to lower level vertices. 

10. The device of claim 6, wherein the feature extrac- 
tion unit extracts a statistrcai feature related to the 
data as the data feature. 

11. The device of claim 10, wherein the statistical fea- 
ture is a feature regarding factors of the data or 
quantif ication theory result corresponding to factors 
of the data, and the data priority order assignment 
rule sequentially assigns the priority orders from a 
first factor among the factors of the data. 

12. The device of claim 10 : wherein the statistical fea- 
ture is a feature regarding factor loading of the data 
or quantification theory result corresponding to fac- 
tor loading of the data, and the data priority order 
assignment rule sequentially assigns the priority or- 
ders from those data attributes which are contained 
in a first factor among factors of the data. 

13. The device of ciaim 10 : wherein the statistical fea- 
ture is a feature regarding correlation coefficients 
among the data attributes or quantification theory 
result corresponding to correlation coefficients 
among the data attributes, and the data priority or- 
der assignment rule sequentially assigns the prior- 
ity orders from those data attributes which have 
higher correlation coefficients among the data. 

14. The device of ciarm 10. wherein the statistical fea- 
ture is a feature regarding dusters of the data or 
quantification theory result corresponding to clus- 
ters of the data ; and the data priority order assign- 
ment rule sequentially assigns the priority orders 
from the clusters in the data. 

15. The device of ciaim 10 ; wherein the statistical fea- 
ture is a correlation coefficient matrix indicating cor- 
relation coefficients among the data attributes . 

16. The device of claim 15. wherein the data priority or- 
der assignment rule assigns the priority orders ac- 
cording to a largest correlation coefficient in the cor- 
relation coefficient matrix and correlation coeffi- 
cients of one set of data attributes having sa*d iarg- 
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est correlation coefficient with respect to other data 
attributes. 

17. The device of claim 16 ; wherein the data priority or- 
der assignment rule assigns a highest priority order s 
to said one set of data attributes, calculates a prod- 
uct of each one of correlation coefficients of one of 
said one set of data attributes with respect to all oth- 
er data attributes and a corresponding one of cor- 
relation coefficients of another one of said one set to 
of data attributes with respect to ali other data at- 
tributes, eliminates those products which are ob- 
tained from correlation coefficients between said 
one set of data attributes, and sequentjally assigns 
subsequent priority orders to those data attributes 15 
for which a value of a product is larger among the 
other data attributes. 

18. The device of claim 1 5 : wherein the data priority or- 
der assignment rule sequentially assigns the prior- 20 
ity orders from those data attributes which have 
higher correlations with respect to all the data at- 
tributes comprehensively according to results of 
numerical calculation processing on correlation co- 
efficients between each data at-tribute and other 25 
data attributes in the correlation coefficient matrix. 

19. The device of claim IS, wherein the numerical cal- 
culation processing obtains a sum of correlation co- 
efficients of each data attribute with respect to other 30 
data attributes, and the data priority order assign- 
ment rule assigns a highest priority order to one da- 
ta attribute for which the sum is largest and sequen- 
tially assigns subsequent priority orders to those 
data attributes which have higher correlation coef- 35 
ficients with respect to said one data attribute. 

20. The device of claim 19, wherein the numerical cal- 
culation processing obtains the sum under a limita- 
tion depending on a prescribed number of data at- 40 
tributes to which the priority orders are to be as- 
signed. 

21. The device of claim 20, wherein the numerical cal- 
culation processing obtains the sum by summing as *s 
many as said prescribed number minus one of 
those correlation coefficients which have larger val- 
ues among correlation coefficients of each data at- 
tribute with respect to other data attributes. 

so 

22. The device of claim 15. wherein the data priority or- 
der assignment rule assigns the priority orders ac- 
cording to a discrete index indicating continuity of 
attribute vafue, said discrete index being calculated 
from a number of attribute value types and a ss 
number 01 records lor each data attribute 

23. The device of claim 22. wherein the data priority or- 



der assignment rule obtains a sum of correlation co- 
efficients of each data attribute with respect to other 
data attributes, obtains the discrete index for each 
data attribute by dividing the number of attribute val- 
ue types by the number of record, assigns a highest 
priority order to one data attribute for which a prod- 
uct of the sum and the discrete index is largest and 
sequentially assigns subsequent priority orders to 
those data attributes which have higher correlation 
coefficients with respect to said one data attribute 

24. The device of claim 1 5 S wherein the data priority or- 
der assignment rule assigns the priority orders by 
merging a first priority order obtained according to 
a largest correlation coefficient in the correlation co- 
efficient matrix and correlation coefficients of one 
set of data attributes having said largest correlation 
coefficient with respect to other data attributes, and 
a second priority order obtained according to results 
of numerical calculation processing on correlation 
coefficients between each data attribute and other 
data attributes in the correiation coefficient matrix 

25. The device of claim 24. wherein the data priority or- 
der assignment rufe sequentially assigns the prior- 
ity orders from those data attributes for which a sum 
of the first priority order and the second priority or- 
der is smaller among the data attributes. 

26. An automatic definition generation method for au- 
tomatically generating a data conversion definition 
to be used in an apparatus for converting data 
stored in database or files into graphic data accord- 
ing to the data conversion definition and displaying 
the graphic data, the method comprising the steps 
of; 

extracting a data feature of the data from a 
scheme and contents of the database or files: 
and 

automatically generating the data conversion 
definition according to the data feature extract- 
ed by the extracting step, the data conversion 
definition being formed by an attribute mapping 
definition defining combinations of data at- 
tributes and graphic data parameters and a da- 
ta conversion method definition defining a 
method for converting a value of each data at- 
tribute into a value of a corresponding graphic 
data parameter. 

27. The method of claim 26, wherein the automatically 
generating step automatically generates the at 
tribute mapping definition that gives an effective vrs- 
ual expression, using a prescribed expression con- 
dition indicating the graphic data parameters to 
which the data attributes should be set in corre- 
spondence at higher priority orders, according to 
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whether a data type of each data attribute to be a 
target of conversion into graphic data is quantitative 
or qualitative 

28- The method of claim 27, wherein the graphic data $ 
parameters are parameters specifying X-axis, Y-ax- 
is ; size, color, iabel and shape of a graphic figure, 
and the expression condition specifies priority or- 
ders among said parameters by whreh the data at- 
tributes are to be set in correspondence to said pa- 10 
rameters ; said priority orders being different orders 
depending on whether the data type of each data 
attribute is quantitative or qualitative. 

29. The method of ciaim 26 ; wherein the automatical '5 
generating step automatically generates the data 
conversion method definition that gives an effective 
visual expression, using a prescribed expression 
condition indicating conversion methods for con- 
verting a value of each data attribute into a value of 20 
a corresponding graphic data parameter according 

to whether a data type of each data attribute to be 
a target of conversion into graphic data is quantita- 
tive or qualitative. 

25 

30. The method of claim 29. wherein the conversion 
methods includes a conversion by logical arrange- 
ment, a conversion by threshold specification, a 
conversion by uniform division specification, and a 
conversion by character string specification, and 30 
the expression condition specifies the conversion 
methods to be different methods depending on 
whether the data type of each data attribute is quan- 
titative or qualitative. 

35 

31 . The method of claim 26 : wherein the extracting step 
also assigns priority orders among data attributes 
to be targets of conversion into graphic data param- 
eters using a prescribed data priority order assign- 
ment rule according to the data feature, and auto- 40 
matically derives attribute mapping combination 
conditions indicating ordered sequences of the data 
attributes to be targets of conversion into graphic 
data parameters that give effective visuat expres- 
sions, 45 

32* The method of claim 31 , wherein the extracting step 
obtains a decision tree for classifying target at- 
tributes with respect to the data, and extracts a fea- 
ture of obtained decision tree as the data feature. so 

33. The method of claim 32 : wherein the feature of the 
decision tree is a feature regarding vertices of the 
decision tree, and the data priority order assign- 
ment rule sequentially assigns the priority orders ss 
from those data attributes which are corresponding 
to upper level vertices. 



34. The method o! claim 32, wherein the feature of the 
decision tree is a teature regarding vertices of the 
decision tree, and the data priority order assign- 
ment rule sequentially assigns the priority orders 
from those data attributes which are corresponding 
to lower level vertices. 

35- The method of claim 31 , wherein the extracting step 
extracts a statistical feature related to the data as 
the data feature 

36. The method of claim 35 : wherein the statistical fea- 
ture is a feature regarding factors of the data or 
quantification theory result corresponding to factors 
of the data, and the data priority order assignment 
rule sequentially assigns the priority orders from a 
first factor among the factors of the data 

37. The method of claim 35, wherein the statistical fea- 
ture is a feature regarding factor loading of the data 
or quantification theory result corresponding to fac- 
tor loading of the data, and the data priority order 
assignment rule sequentially assigns the priority or- 
ders from those data attributes which are contained 
in a first factor among factors of the data. 

3B. The method of claim 35 s wherein the statistical fea- 
ture is a feature regarding correlation coefficients 
among the data attributes or quantification theory 
resuft corresponding to correlation coefficients 
among the data attributes : and the data priority or- 
der assignment rule sequentially assigns the prior- 
ity orders from those data attributes which have 
higher correlation coefficients among the data. 

39. The method of claim 35, wherein the statistical fea- 
ture is a feature regarding clusters of the data or 
quantification theory result corresponding to clus- 
ters of the data, and the data priority order assign- 
ment rule sequentially assigns the priority orders 
from the clusters in the data 

40. The method of claim 35 ; wherein the statistical fea- 
ture is a correlation coefficient matrix indicating cor- 
relation coefficients among the data attributes. 

41. The method of claim 40 : wherein the data priority 
order assignment ru^e assigns the priority orders 
according to a largest correlation coefficient in the 
correlation coefficient matrix and correlation coeffi- 
cients of one set of data attributes having said larg- 
est correlation coefficient with respect to other data 
attributes. 

42. The method of claim 41 wherein the data priority 
order assrgnment rule assigns a highest priority or- 
der to said one set of data attributes, calculates a 
product of each one of correlation coefficients of 
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one of said one set of data attributes with respect 
to aH other data attributes and a corresponding one 
of correlation coefficients of another one of said one 
set of data attributes with respect to ail other data 
attributes, eliminates those products which are ob- s 
tained from correlation coefficients between sard 
one set of data attributes, and sequentially assigns 
subsequent priority orders to those data attributes 
for which a value of a product is larger among the 
other data attributes w 

43. The method of claim 40 ; wherein the data priority 
order assignment rule sequentially assigns the pri- 
ority orders from those data attributes which have 
higher correlations with respect to ali the data at- 15 
tributes comprehensively, according to results of 
numerical calculation processing on correlation co- 
efficients between each data attribute and other da- 
ta attributes in the correlation coefficient matrix. 

20 

44. The method of claim 43, wherein the numerical cal- 
culation processing obtains a sum of correlation co- 
efficients of each data attribute with respect to other 
data attributes and the data priority order assign- 
ment rule assigns a highest pnority order to one da- 25 
ta attribute lor which the sum is largest., and sequen- 
tially assigns subsequent priority orders to those 
data attributes which have higher correlation coef- 
ficients with respect to said one data attribute 

30 

45. The method of claim 44. wherein the numerical cal- 
culation processing obtains the sum under a limita- 
tion depending on a prescribed number of data at- 
tributes to which the priority orders are to be as- 
signed. 35 

46. The method of claim 45, wherein the numerical cal- 
culation processing obtains the sum by summing as 
many as said prescribed number minus one of 
those corretation coefficients which have larger val- 40 
ues among correlation coefficients of each data at- 
trrbute with respect to other data attributes. 

47. The method of claim 40 ; wherein the data priority 
order assignment rule assigns the priority orders 45 
according \o a discrete index indicating continuity 

of attribute value, said discrete index being calcu- 
lated from a number of attribute value types and a 
number of records for each data attribute. 

so 

48. The method of claim 47 : wherein the data priority 
order assignment rule obtains a sum of correlation 
coefficients of each data attribute with respect to 
other data attributes., obtains the discrete index for 
each data attribute by dividing the number of at- ss 
tribute vaiue types by the number of record, assigns 

a highest priority order to one data attribute for 
which a product of the sum and the discrete index 



is largest, and sequentially assigns subsequent pri- 
ority orders to those data attributes which have 
higher correlation coefficients with respect to said 
one data attribute. 

49. The method of claim 40. wherein the data priority 
order assignment rute assigns the priority orders by 
merging a first priority order obtained according to 
a largest correlation coefficient in the correlation co- 
efficient matrix and correlation coefficients of one 
set of data attributes having said largest correlation 
coefficient with respect to other data attributes, and 
a second priority order obtained according to results 
of numerical calculation processing on correlation 
coefficients between each data attribute and other 
data attributes in the correlation coefficient matrix. 

50. The method of claim 49, wherein the data priority 
order assignment rule sequentially assigns the pri- 
ority orders from those data attributes for which a 
sum of the first priority order and the second priority 
order is smaller among the data attributes, 

51. An article of manufacture, comprising: 

a computer usable medium having computer 
readable program code means embodied therein 
for causing a computer to function as an automatic 
definition generation system for automatically gen- 
erating a data conversion definition to be used in an 
apparatus for converting data stored in database or 
files into graphic data according to the data conver- 
sion definition and displaying the graphic data, the 
computer readable program code means includes: 

first computer readable program code means 
for causing said computer to extract a data fea- 
ture of the data from a scheme and contents of 
the database or files; and 
second computer readable program code 
means for causing said computer to automati- 
cally generate the data conversion definition 
according to the data feature extracted by the 
first computer readable program code means, 
the data conversion definition being formed by 
an attribute mapping definition defining combi- 
nations of data attributes and graphic data pa- 
rameters and a data conversion method defini- 
tion defining a method for convening a value of 
each data attribute into a value of a correspond- 
ing graphic data parameter 

52. The article of manufacture of claim 51 , wherein the 
second computer readable program code means 
automatically generates the attribute mapping 6ei- 
inition that gives an effective visual expression, us- 
ing a prescribed expression condition indicating the 
graphic data parameters to which the data at- 
tributes should be set in correspondence at higher 
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priority orders, according to whether a data type of 
each data attribute to be a target of conversion into 
graphic data is quantitative or qualitative. 

53. The article of manufacture of claim 52, wherein the 
graphic data parameters are parameters specifying 
X-axis, Y-axis, size, color, label and shape of a 
graphic figure, and the expression condition speci- 
fies priority orders among said parameters by which 
the data attributes are to be set in correspondence 
to said parameters, sasd priority orders being differ- 
ent orders depending on whether the data type of 
each data attribute is quantitative or qualitative. 

54. The article of manufacture of claim 51 , wherein the 
second computer readable program code means 
automatically generates the data conversion meth- 
od definition that gives an effective visual expres- 
sion, using a prescribed expression condition indi- 
cating conversion methods for converting a value 
of each data attribute into a value of a correspond- 
ing graphic data parameter, according to whether a 
data type of each data attribute to be a target of con- 
version into graphic data is quantitative or qualita- 
tive 

55. The article of manufacture of ciaim 54. wherein the 
conversion methods includes a conversion by logi- 
cal arrangement, a conversion by threshold speci- 
fication, a conversion by uniform division specifica- 
tion, and a conversion by character string specifica- 
tion, and the expression condition specifies the con- 
version methods to be different methods depending 
on whether the data type of each data attribute is 
quantitative or qualitative 

56. The article of manufacture of claim 51 , wherein the 
first computer readable program code means also 
assigns priority orders among data attnbutes to be 
targets of conversion into graphic data parameters 
using a prescribed data priority order assignment 
rule according to the data feature, andautomatically 
derives attribute mapping combination conditions 
indicating ordered sequences of the data attributes 
to be targets of conversion into graphic data param- 
eters that give effective visual expressions 

57. The article of manufacture of claim 56, wherein the 
first computer readable program code means ob- 
tains a decision tree for classifying target attributes 
with respect to the data ; and extracts a feature of 
obtained decision tree as the data feature 

58. The artide of manufacture of claim 57, wherein the 
feature of the decision tree ts a feature regarding 
vertices of the decision tree, and the data priority 
order assignment rule sequential^ assigns the pri- 
ority orders from those data attributes which are 



corresponding to upper level vertices. 

59. The article of manufacture of claim 57 : wherein the 
feature of the decision tree is a feature regarding 
£ vertices of the decision tree, and the data priority 

order assignment rule sequentially assigns the pri- 
ority orders from those data attributes which are 
corresponding to lower level vertices. 

10 60. The article of manulacture of claim 56, wherein the 
first computer readable program code means ex- 
tracts a statistical feature related to the data as the 
data feature 

is 61. The article of manufacture of claim 80, wherein the 
statistical feature is a feature regarding factors of 
the data or quantification theory result correspond- 
ing to factors of the data, and the data priority order 
assignment rule sequentially assigns the priority or- 

20 ders from a first factor among the factors of the data 

62. The article of manufacture of claim 60 : wherein the 
statistical feature is a feature regarding factor load- 
ing of the data or quantification theory result corre- 

25 sponding to factor loading of the data, and the data 

priority order assignment rule sequentially assigns 
the priority orders from those data attnbutes which 
are contained in a first factor among factors of the 
data. 

30 

63. The article of manufacture of claim 80, wherein the 
statistical feature is a feature regarding correlation 
coefficients among the data attributes or quantifica- 
tion theory result corresponding to correlation coet- 

35 ficients among the data attributes, and the data pri- 

ority order assignment rule sequentially assigns the 
priority orders from those data attributes which have 
higher correlation coefficients among the data 

40 64. The article of manufacture of ciairn 60, wherein the 
statistical feature is a feature regarding clusters of 
the data or quantification theory result correspond- 
ing to clusters of the data, and the data priority order 
assignment rule sequentially assigns the priority or- 

45 ders from the clusters in the data. 

65, The article of manufacture of claim 60, wherein the 
statistical feature is a correlation coefficient matrix 
indicating correlation coefficients among the data 

50 attributes. 

66. The article of manufacture of claim 65, wherein the 
data priority order assignment rule assigns the pri- 
ority orders according to a largest correlation coef- 

55 ficient in the correlation coefficient matrix and cor- 
relation coefficients of one set of data attributes 
having said largest correlation coefficient with re- 
spect to other data attributes. 



14 



27 



EP 0 863 469 A2 



28 



67. The article of manufacture of claim 66 ; wherein the 
data priority order assignment rule assigns a high- 
est priority order to said one set of data attributes, 
calculates a product of each one of correlation co- 
efficients of one of said one set of data attributes s 
with respect to all other data attributes and a corre- 
sponding one of correlation coefficients of another 
one of said one set of data attributes with respect 

to all other data attributes, eliminates those prod- 
ucts which are obtained from correlation coeffi- to 
cients between said one set of data attributes, and 
sequentially assigns subsequent priority orders to 
those data attributes for which a value of a product 
is larger among the other data attributes. 

is 

68. The article of manufacture of claim 65 : wherein the 
data priority order assignment rufe sequentiaffy as- 
signs the prionty orders from those data attributes 
which have higher correlations with respect to all 

the data attributes comprehensively, according to 20 
results of numerical calculation processing on cor- 
relation coefficients between each data attribute 
and other data attributes in the correlation coeffi- 
cient matrix. 

2$ 

69. The article of manufacture of claim 66, wherein the 
numencal calculation processing obtains a sum of 
correlation coefficients ot each data attribute with 
respect toother data attributes, and the data priority 
order assignment rule assigns a highest priority or- 30 
der to one data attribute for which the sum is largest, 
and sequentially assigns subsequent priority orders 

to those data attributes which have higher correla- 
tion coefficients with respect to said one data at- 
tribute. 35 

70. The article of manufacture of claim 69, wherein the 
numerical calculation processing obtains the sum 
under a limitation depending on a prescribed 
number of data attributes to which the priority orders 40 

are to be assigned, 

71. The article of manufacture of claim 70, wherein the 
numerical calculation processing obtains the sum 

by summing as many as said prescribed number mi- 4S 

nus one of those correlation coefficients which have 
larger values among correlation coefficients of each 
data attribute with respect to other data attributes. 

72. The article ot manufacture of claim 65, wherein the so 
data priority order assignment rule assigns the pri- 
ority orders according to a discrete index indicating 
continuity of attribute va^ue, sajd discrete index be- 
ing calculated from a number of attribute value 
types and a number of records for each data at- 55 
tribute. 

73. The article of manufacture of claim 72, wherein the 



data priority order assignment rule obtains a sum of 
correlation coefficients of each data attribute with 
respect to other data attributes, obtains the discrete 
index for each data attribute by dividing the number 
of attribute value types by the number of record, as- 
signs a highest priority order to one data attribute 
tor which a product of the sum and the discrete in- 
dex is largest, and sequentially assigns subsequent 
priority orders to those data attributes which have 
higher correlation coefficients with respect to said 
one data attribute. 

74, The article of manufacture of claim 65, wherein the 
data prionty order assignment rute assigns the pri- 
ority orders by merging a first priority order obtained 
according to a largest correlation coefficient in the 
correlation coefficient matrix and correlation coeffi- 
cients of one set of data attributes having sa^d larg- 
est correlation coefficient with respect to other data 
attributes, and a second priority order obtained ac- 
cording to results of numerical calculation process- 
ing on correlation coefficients between each data 
attribute and other data attributes in the correlation 
coefficient matrix 

75. The article of manufacture of clam 74. wherein the 
data priority order assignment rule sequentially as- 
signs the priority orders from those data attributes 
for which a sum of the first priority order and the 
second priority order is smalter among the data at- 
tributes. 
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